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Abstract 

A problem  studied  by  Flanders  (1975)  is  to  minimize  the  function 
f(R)  = tr(SR  + TR  ^)  over  the  set  of  positive  definite  matrices  R, 
where  S and  T are  positive  semi -.definite  matrices.  Alternative 
proofs  that  may  have  some  intrinsic  interest  are  provided.  The  proofs 
explicitly  yield  the  infimum  of  f(R).  One  proof  is  based  on  a convexity 
argument  and  the  other  on  a sequence  of  reductions  to  a univariate  problem. 

1.  Introduction. 

Flanders  (1975)  studied  a matrix  problem  that  arose  in  eleetric 
circuit  theory.  Let  wl'’’'*wm  comP-Lex  column  vectors 

of  length  n and  consider  the  real -valued  function 

^ "j  -i 

(l)  f(R)  = z,  Rz,  + • • • + z Rz  + w_,R~  w_,  + •••  + w R w , 

11  mmll  mm' 

where  R > 0 denotes  an  n x n positive  definite  Hermitian  matrix.  The 
problem  is  to  minimize  f(R)  over  the  set  of  positive  definite  R. 

If  we  set 

Z = (z-^j . . . , z^)  , W = (w.^, . . . ^w^)  ,> 

S = ZZ*  , T = WW*  , 
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then  (l ) becomes 


* -X-  1 -)f  -X- 

(2)  f (R)  = tr  R(z-.z..  + •••  + zz  ) + tr  R (w,w,  + ...  + w w ) 

' ' 1 1 m m'  1 1 1 mm' 

= tr  R ZZ*  + tr  R_1  WW* 

= tr  R S + tr  R”^  T , 

where  S and  T are  positive  semi -definite  matrices  of  order  n.  We  shall 
write  A > 0 and  A > 0 to  denote  that  A is  positive  semi -definite  and 
positive  definite,  respectively. 

The  result  obtained  by  Flanders  (1975)  is  as  follows: 

Theorem  1.  If  f (R)  is  defined  on  the  set  of  positive  definite  matrices 
by  (l)  or  (2),  and  if  A = Z*W,  then 


rank 


(i)  inf  f(R)  = 2 tr  (AA*)1  ^ = 2 tr(A*A)1//2  , 

R > 0 

* !/2 

(ii)  f(RQ)  = 2 tr(AA  ) for  some  R^  > 0 if  and  only  if 
(Z)  = rank  (W)  = rank  (A)  . 

Flanders  first  proves  that 


f(R)  > 2 tr(AA*) 


1/2 


and  then  discusses  the  approach  to  equality.  However,  the  matrix  R 
that  achieves  equality  (when  the  condition  of  (ii)  is  satisfied)  is  not 
exhibited  in  any  simple  manner.  We  now  provide  alternative  somewhat 
simpler  proofs  of  (i)  that  may  have  some  intrinsic  interest. 
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2. 


Matrices  are  of  full  rank. 


First  Alternative  Proof.  This  proof  is  based  on  th/e  fact  that  f(R)  is 
a convex  function  of  R.  The  function  tr  RS  is  linear  in  R,  and 
tr  R ^T  is  convex  in  R,  i.e., 

tr(aR1  + (l-a)R2)-1  T < a tr  R^T  + (l-a)  tr  R^T,  0 < a < 1 , 

^or  an<i  Rg  positive  definite.  The  inequality  is  strict  unless 

R1  = R2  or  a = 0 or  1-  Consequently,  f(R)  is  (strictly)  convex, 
and  we  need  to  minimize  a convex  function  over  a convex  set.  Since 
f (R)  ->  oo  as  R ■»  0 or  as  R -*oo,  the  minimum  is  achieved  at  an  interior 
point,  namely  where  df (R)/dR  = 0.  But 

df  = tr  S(dR)  - tr  r"1  (dR)R_1T  = tr  dR(S-R~1TR"1)  ; 

so  that  there  is  an  interior  point  R satisfying  S-R^TR-"1  = 0,  or 
equivalently, 

(3)  R S R = T , 

that  is  the  minimizer  of  f (R).  Rote  that  tr  RS  - tr  R-1!,  so  that 

f(R)  = 2 tr  RS  . 


Furthermore,  from  (3), 
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so  that 


s1//2rsrs1//2  = (s1,/2rs1//2  ) = s1//2ts1//2  ; 


S1//2RS1//2 


(S1//2TS1//2 ) 


1/2 


and 

~ ~ l /p  i /p  l/2 

f(R)  = 2 tr  RS  = 2 tr(S  1 TS  ; ) 

Here  we  have  used  the  positive  definite  square  root.  However, 
any  square  root, e.g.,  S = LL  , T = MM  can  be  used,  in  which 

case  the  result  is 

f(R)  = 2 tr(L*MM*L)1//2  - 2 tr (M*LL*M)ly/2  . 

Let  (A)  denote  the  characteristic  roots  of  the  matrix  A. 

tr(LVl)1^2  = Z \._[(L*MM*L)1//2]  = Z[X± (L*MM*L) ]^2 

= z[\±  (mm*ll*)  ]X^2  = z^Ors)]1/2  , 

= tr  (T1//2ST1//2  y1^2  = tr  (S^TS1^2  ^ , 

we  see  that  all  square  roots  yield  the  same  result. 


Becaus  e 
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Second  Alternative  Proof.  This  proof  is  based  on  a sequence  of  reductions 


until  finally  we  obtain  an  extremal  problem  of  distinct  variables.  There 
exists  a nonsingular  matrix  H such  that 


T-1  = HH*  , S = HD  H*  , 


where  is  a diagonal  matrix  with  real  elements  (d^ dn)  ordered 

0 < cL  < • • • < d , which  are  the  roots  of 
1 — — n7 


0 = | S-yT-1 | = | T-1 | • | ST-yl|  = ! T-1 | 


S1/^TS1/“-yI 


Then 


f(R)  = tr  RHD  H*  + tr  R_1H*“1h“1 

= tr  H*RHDd  + tr (H*RH)_1  = tr  GDd  + tr  G_1  , 

where  G = H*RH.  Let  G = QI1  Q*.  where  Q is  unitary  and 

= diag(\n . . . ,\  ) is  diagonal  with  positive  diagonal  elements 

ordered  0 < A,  < • • • < The  function  to  be  minimized  (with 

respect  to  Q and  D.  ) is 

A. 

1 n n 

(4)  tr  ««  D.  + tr  D”  = F dA.Iq.  .\  + £ . 

\ d \ ill1' 

By  a theorem  of  von  Neumann  (1937)  [see  also  Fan  (1951)];  the 

minimum  of  the  first  sum  in  (7 ) with  respect  to  Q is  dA.,  and 

«j  «D‘  J 
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the  minimizing  Q is  I.  Then  the  minimum  of  + ^j=l  ^ j 

-1  /2 

with  respect  to  X^j  • • • is  attained  at  X^  = d..  and  the 

/is  . ~ _n  ,l/2 

minimized  value  of  (4;  xs  2 d. 

Remark.  If  one  or  more  of  the  d..  are  0 , then  the  infimum 

— — • j 

cannot  be  attained. 

The  minimizing  matrix  R is 

R = (H*)_1GH“1  = (h*)"1B“1/2h“1  = (HD^^H*)'1  . 

This  matrix  satisfies  (3)- 

3.  Matrices  not  full  of  rank. 

In  the  case  when  rank(S)  and/or  rank(T)  is  less  than  n we 

show  how  to  reduce  the  problem  to  a canonical  form  from  which  we  may- 
then  invoke  the  result  for  full  rank. 

Note  that  f(R)  = tr  RS  + tr  R-1T  is  invariant  with  respect  to 

the  transformation 

(R,S,T)  -*  (QRQ*,  Q*"1SQ_1 , QTQ* ) 

for  any  nonsingular  matrix  Q.  Furthermore,  the  ranks  of  S,  T and  ST 
are  invariant  under  this  transformation.  Ry  judiciously  choosing  a 
matrix  Q,  we  can  effect  a simplification  of  the  problem. 

Theorem  2.  If  S > 0,  T > 0,  then  there  exists  a nonsingular  matrix 
V such  that 
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(5) 


T = VTV*  = 


■ I 

1 T 

0, 

1 

, s = v*"1sv“1  = | d 

°J 

\ 0 

0 1 

’ 0 

vJ 

where  t = rank(T),  Dd  = diag^, . . . ,dT),  rank(M)  = rank(S)  - rank(ST), 
and  rank(Dd)  = rank(ST). 

Suppose  for  the  moment  that  Theorem  2 holds.  Then 


f (R)  = tr  RV  ! 


i D , 0 ?, 

'*  I d 


‘ 0 M 


“1  -1  ’ ^ \ 1 
V + tr  R V 1 ; j V*-1 

; 0 0 I 


/D . 

0 \ 

- ! I 

°l 

= tr  R 1 

| 

+ tr  R-1  | T 

<0 

M » 

■ 0 

0* 

where  R = VRV*.  Minimization 

of 

f(R)  over  R 

> ( 

minimization  over  R > 0.  Consequently, 


is  equivalent  to 


f(R)  = tr  RL1Dd  + tr  R, + tr^"1)^ 

= tr  + tr  R22fl  + tr(Ru-aL2Sg^1)'1  . 

Since  > tr  f(R)  is  minimized  by  taking 

^12  = m®‘  Ib-e*1 

MLn  f(E)  = Min  [tr  Ld.  + tr  R-1  + tr  R Ml  . 

~ ~ ~ 11  d 11  22 

R12' ’ R11 ’ R22  R11 * ^22 

'Three  rank  cases  need  to  be  considered. 

(i)  Rank(T)  > rank (ST).  Then  one  or  more  of  the  d are  0, 

3 } 

in  which  case  the  infimum  is  not  attained.  (See  discussion  leading 
to  the  remark  in  the  second  alternative  proof. ) 
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(ii)  Rank  (S ) > rank  (ST ).  Then  M / 0 and  the  infimum  of  0 is 
not  attained. 

(iii ) Rank(T)  = rank(S)  = rank(ST).  Then  M = 0 and  rank(Dd)  = t, 
so  that 


inf  f(R)  = inf  (tr  R D + tr  R"^) 

R > 0 R_  > 0 11  d 11 

and  the  problem  has  been  reduced  to  the  case  of  the  one  with  full  rank. 
This  completes  the  proof  of  Theorem  1.  || 

To  prove  Theorem  2 we  use  the  following  two  lemmas. 

Lemma  1.  A nonsingular  matrix 


if  and  only  if  G 2d  = 0 , is  unitary,  and  G 22  is  nonsingular. 

Proof-  Multiply  in  (6)  to  obtain  GUG*1  = IT,  G21G*1  = 0,  from 
which  the  conclusion  follows.  || 

Lemma  2.  Given  Q > 0 there  exists  a G12  and  a nonsingular  G22 
such  that 


S.1G12  + S.2G22  ~ 0 * 
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Proof-  Q has  the  triangular  decomposition 


T T \ 
11  12  i 


0 T 


22 


, Tt  T 

' 11  11 


? rp  j rp 
\ 12  11 


T’  T 
11  12 


rp  t rp  t rp  t m 
12  12  A22  2£ 


The  rows  of  ) sPan  a space  of  dimension  less  than  or  equal 

to  t.  The  n«T  columns  of  (1.2)  can  be  chosen  to  be  orthogonal  to 
. \ 22 

the  rows  of  (T11^T12)  and  linearly  independent  (so  that  G } is  nonsingular).  || 
P£ggf_P.f-  Theorem  2.  Let  U be  any  nonsingular  matrix  such  that 


and  define 


0 

0 


* 

U , 


s = u su  . 


Let  G be  any  nonsingular  matrix  satisfying  Lemma  2 and  define 


S = G*SG 


\ 


G11S11G11 


G11  ('llG12+Gi2G22  ^ 


(G12S11+G22S21)G11  G12  (G11G12+G12G22  ^ + G22  (GllG12+l2G22 } - 


Using  Lemma  2,  the  matrix  G10  can  be  chosen  so  that  S^G-^+S. 
so  that 


12  ~v-  ^.uUumi  cu  oiiao  °ipu'3_2T012G22  = 
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0 


/ Gn  Gn  G 


11  11^11 


\ 


S = 


G22 (S2iGi2+^22G22 ^G22  / 


Since  G-] . is  unitary,  we  may  choose  it  to  diagonalize  S.)  1 . This 
completes  the  proof  of  Theorem  2.  || 


Remark.  In  the  second,  alternative  proof  we  used  the  fact  that  if  A and 
B are  positive  definite,  then  both  can  be  diagonalized  simultaneously  by 
a nonsingular  matrix  W,  i.e.. 


(7) 


A = WW*  , 


* 


B = WD.W 

u 


where  D,  = diag  (0,  , . . . ,0  ),  and  6,  > • • • > 6 >0  are  the  charac- 
0 ° 1^  ' n " 1 — — n — 

-1 

teristic  roots  of  A B.  When  the  hypotheses  of  positive  definiteness 
are  removed  the  simultaneous  decomposition  may  no  longer  be  accomplished 
in  general.  Theorem  2 complements  the  above  result  by  providing  a 
simultaneous  decomposition.  This  can  be  stated  in  a form  parallel  to 
(7). 


Theorem  3-  If  A > 0,  B > 0,  then  there  exists  a nonsingular  matrix 
W such  that 


WAW*  - 


fDa 

0 

0 

0 \ 

/ IT 

0 

0 

0\ 

f 6 

/ 

\ 

f 0 

0 

0 

0 \ 

0 

I 

0 

0 ’ 

WBW*  = 

. o 

0 

I 

0 

\ 0 

0 

0 

0 1 

. 

/ 

\ 

\o 

0 

0 

0 / 

\0 

0 

0 

o / 
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where  = diag  (e^, . . . ,6  T)  are  the  nonzero  characteristic  roots  of 
|B-0A|  = 0. 

Proof . The  result  follows  from  Theorem  2 by  noting  that  if  V is  a 
matrix  satisfying  (5),  then  for  any  nonsingular  matrix  C, 

I 1 0 

W = V 

\ 0 c 

also  satisfies  (5).  Since  M > 0,  there  exists  a C such  that 
CMC*  = diag (1,0).  || 

An  equivalent  form  of  Theorem  3 was  obtained  (but  not  published) 
by  Olkin  (1951).  We  suspect  that  this  result  is  even  older,  but  have 
not  been  able  to  locate  a reference. 
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